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Abstract. We study a model of growing planar tree graphs where in 
each time step we separate the tree into two components by splitting a 
vertex and then connect the two pieces by inserting a new link between 
the daughter vertices. This model generalises the preferential attach- 
ment model and Ford's Q-model for phylogenetic trees. We develop 
a mean field theory for the vertex degree distribution, prove that the 
mean field theory is exact in some special cases and check that it agrees 
with numerical simulations in general. We calculate various correlation 
functions and show that the intrinsic Hausdorff dimension can vary from 
one to infinity, depending on the parameters of the model. 
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1. Introduction 

1.1. Motivation. Random trees arise in many branches of science ranging 
from mathematics through computer science, physics and chemistry to bi- 
ology and sociology. Trees are the simplest generalisations of random walks 
and are used to model various relationships like family trees or phylo- 



genetic trees [2l|, |24] and evolving populations. They also model physical 
objects that look like trees, e.g. branched polymers, and are used to encode 
information regarding the secondary structure of macromolecules, RNA fold- 
ing being one prominent example [l4'| . Information about more complicated 
geometrical objects like planar maps and random surfaces which arise in 
quantum gravity 0| can sometimes be encoded in labelled random trees [l^. 
Trees appear also in relation with fragmentation and coagulation processes 

There are two principal classes of models that have been used to model 
trees. The first one is equilibrium statistical mechanics, where trees are 
assigned a certain weight factor (Boltzmann factor) which induces a prob- 
ability measure on the class of trees under consideration. The weight factor 
depends on the local properties of the trees. The second class is growing 
trees where one starts with a simple tree and then continues to add vertices 
and links according to some (in general stochastic) growth rules. After a 
fixed time t the growth rules induce a probability measure on the trees that 
can arise after time t. Time can be taken to be discrete or continuous. The 
trees themselves are in general discrete and are therefore tree graphs in the 
usual sense, but there exist also models of continuous random trees p]. 

The first class of models is the one that appears most frequently in physics 
whereas the second class is the natural one to use to analyse, for example 
growing family trees, citation networks, and stochastic processes. It is well- 
known that in certain cases these two approaches to studying random trees 
are equivalent. For example, the so called generic random trees that appear 
in the study of gravity are equivalent to a critical Galton- Watson process 
[l5| . So called causal trees are believed to be equivalent to certain classes of 
growing trees 0] . 

In this paper we introduce and study a general model of growing trees, 
where growth occurs by random vertex splitting, with very general rules. 
This model includes as special limit cases several models of growing trees al- 
ready studied in the physics and the mathematics literature: the well known 
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preferential attachement model (which models for example the growth of 
branched polymers out of a soup of monomers or the growth of the internet 
[H, [13]); the a- model 16] which is a stochastic model of cladograms (binary 



leaf-labelled trees studied in relation with phylogenetic trees and biological 
systematics) and its extensions [l3] • It is also related to a tree growth model 



that arises in the theory of RNA secondary structures [IJ]. These discrete 
time tree growth models are in fact closely related to the continuous time 
tree growth models which describe e.g. self similar fragmentation processes 
and coalescence processes (see 0] for general reference, and for re- 

cent works), and seem to be related to sequential packing processes. Finally, 
back on the physics side, let us mention that growth process by vertex split- 
ting, together with vertex merging processes, is important in Monte Carlo 
simulations of discrete Euclidean and Lorentzian quantum gravity models 
(see e.g. jl]). 

Our main motivation is to develop general tools to study the properties of 
models of random tree growth. In particular we are motivated by the issues 
of unification and of universality: Is there a general tree growth process 
which can encompass the different models which are known at the moment? 
How many different universality classes, i.e. continuous tree models with 
different scaling properties (exponents and correlation functions), exist in 
this framework? The results presented here are a first step in this direction. 

1.2. Description of the model. Let us first describe our model infor- 
mally, a precise definition will be given in the next section. Suppose that 
at an initial time to we are given a tree which will always be assumed to be 
finite, rooted and planar (i.e. at each vertex the attached links are ordered 
cyclically). This assumption of planarity is convenient for the description 
of the model and for the proofs, but is not essential. In order to evolve the 
tree we choose a vertex at random with a relative probability Wi which only 
depends on the order i of the vertex. We split the chosen vertex into two 
vertices of order j and k which are linked to each other and attach the links 
of the original vertex of order i to the two new vertices so j -|- A; = i -|- 2 
and the planarity is preserved in this process, see Fig. [H We sometimes 




Figure 1. The vertex splitting process, 
let the relative probability of this splitting, wj^k, depend on j and k but 
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sometimes we take the miiform distribution on the i{i + l)/2 possibihties. 
The parameters Wi and wj^k define the model. We shah consider both the 
case when there is a maximal vertex degree dmax, i-e., wj^k = if either j 
or k exceeds the cutofi^ dmaxj and the general case, but most of our explicit 
results will be for a finite c?max- 

This growth process becomes the preferential attachment growth model 
[H, when we take wj^k = unless j or k is equal to 1. As already 
mentioned, similar processes of random vertex splitting and random vertex 
merging are used as ergodic moves in Monte Carlo simulations of triangu- 
lations in quantum gravity, see Fig. [2j The connection with the models of 



We are interested in the structure of the trees after a large number of steps 
in the growth process. The main quantities we study are the distribution 
of the order of vertices, the correlation between the degrees of neighbouring 
vertices and the intrinsic Hausdorff dimension of the trees. In order to 
get at the Hausdorff dimension we shall define and analyse certain general 
correlation functions which describe the subtree structure of the random 
tree. 

The calculations are simpler when the splitting weight wj depends linearly 
on j (wj = aj + b) because then the probability pj of splitting a vertex of 
degree j in a tree T is given hy pj = Wj /F(\T\) where F is a function only of 
the number of vertices \T\ in T but does not depend on the internal structure 
of T. In the general case our results will rely on a physically plausible, but 
still unproven, mean field assumption. We shall provide numerical evidence 
for this assumption. 

1.3. Results and organisation of the paper. In section [2] we first give 
the precise definition of our vertex splitting model. We then study the special 
case where the splitting probability weights are linear with the initial vertex 
degree i and focus on the vertex degree distribution. 

In section 12.21 we write exact recurrence equations for the general local 
vertex degree probability distributions. Using the Perron- Frobenius theorem 
[25! we show that the single vertex degree probability distribution p = {pk} 
{pk is the density of vertices with a given degree k) has a well defined limit 
as the size of the tree goes to infinity. We furthermore show that p = {pk} 
is given by an eigensystem equation of the form Bp = Ap, where B is a 
matrix depending on the weights of the model. The proof depends on the 





Figure 2. Vertex splitting (left to right) and vertex merging 
(right to left) in a triangulation. 
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matrix B being diagonalizable. Similar techniques have been used to find 
the asymptotic degree distribution in random recursive trees . 

In section [2^ we find explicit solutions to the above eigensystem equation 
in special cases when the matrix B is diagonalizable. We study in almost 
full generality the cases when dmax = 3 and dmax = 4 and a particular choice 
of weights for arbitrary dmax- In section [2l4l we show that the condition of 
diagonalizability of B is not very restrictive. 

In section 12.51 we relax the condition of linearity on the splitting weights 
Wi. We argue that mean field theory is still valid and that the degree proba- 
bility distribution p is still given by the same linear eigensystem equation as 
in the linear case. We give good numerical evidence of the validity of these 
mean field equations for dmax = 3 trees. 

For infinite dmax and linear and uniform splitting probabilities we can still 
calculate the vertex degree distribution in closed form using mean field the- 
ory. This is done in section [2^ where we show that it agrees with numerical 
simulations. The vertex degree distribution is found to fall off factorially in 
this case. The vertex degree distribution always becomes independent of the 
initial tree as time goes to infinity. 

In section[3]we study probabilities associated to the local subtree structure 
of the tree, as seen from any vertex, and as a function of its creation time s. 
More precisely, we are able to write recursion relations for the probability 
Pfc(^i) ■ ■ ■ j^fcj •s) that the vertex created at time s is of degree k, with the k 
subtrees with fixed respective sizes ii, - • • ,ik- 

In section 13.11 we write the detailed (and quite complicated) recursion 
relations in the linear splitting weights case, and we show that they simplify 
for the probabilities j^fc) obtained by summing over the time of 

creation. 

In section [3^2] we derive a recursion relation for a simple two-point function 
Qkii^i,^2) related to the decomposition of a tree into two subtrees, that will 
be useful. The substructure probabilities should find several applications in 
the studies of these trees. 

In section |4] we study the scaling properties of the trees by computing 
their Hausdorff dimension (or intrinsic fractal dimension). 

In section 14.11 we define the radius and the fractal dimension dn of a tree 
(since there are several almost equivalent definitions). 

In section 14.21 we explain how the radius (hence the fractal dimension) is 
obtained from the two-point function defined in 13.21 

In section 14.31 using a natural scaling hypothesis for the two-point func- 
tion, we show that the fractal dimension dn is also given by the solution 
of an eigensystem equation of the form Cu: = W2/dH^, where C is a com- 
plicated matrix which is a function of weights of the model. We use a 
Perron-Frobenius argument to prove that this eigensystem equation has a 
unique physical solution. 

In section 14.41 we establish bounds on the fractal dimension. We show 
that it can vary continuously with the splitting weights between 1 and +oo. 
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In section 14.51 we study the case of (dmax = 3) trees with hnear uniform 
sphtting weights. We compare the analytical result for dn with numerical 
simulations and show that the agreement is good. 

In section 14.61 we relax the condition of linearity on the splitting weights 
Wi. We argue that mean field theory is still valid and that the fractal di- 
mension dn is still given by the same linear eigensystem equation as in the 
linear case. We present preliminary numerical evidence of the validity of 
these mean field equations for binary trees. 

In section Owe study the correlations between the degrees of neighbouring 
vertices. This amounts to studying the density pij of links with vertices of 
degrees i and j. 

In section 15.11 we write general equations for these correlations in the 
linear splitting weight case. Some technical details are left to appendix [Al 

In the simple case of dmax = 3 trees these correlations can be calculated 
explicitly, and compared with numerical simulations. This is done in section 
5.2\ and one sees that there are always nontrivial correlations, i.e., pij ^ 
PiPj/{l — pi)~^ where pi is the density of vertices of degree i (the r.h.s. 
obtained if the degrees of neighbouring vertices are completely independent). 
Correlations between vertices which are further away from each other can 
be studied by the same method. We show that our results for the degree- 
degree correlations are different from those for a different class of statistical 
random graphs [9]. 

In section 15.31 we extend our results for the case of non-linear splitting 
weights, assuming mean field theory. We show that there is a very good 
agreement between our analytical results and numerical simulations (still in 
the case of dma.x = 3 trees). 

Finally section [6] is the conclusion. We discuss there in more detail the 
relationship between our model and other models of random trees, in par- 
ticular models of phylogenetic trees [1, 0, 12, [H, IS, [12], and we point out 
some open questions. 



2. The vertex splitting model 

2.1. Description of the model. A rooted planar tree is a tree graph em- 
bedded into the plane which contains a distinguished vertex that we call 
the root. We will always assume that the root has order one. The planarity 
condition is for convenience only and can also be implemented by cyclically 
ordering the links incident on each vertex. 

Let T be the collection of all rooted planar trees for which every vertex 
has finite degree at most 

c^max = d maximal vertex degree. (2-1) 

We shall discuss later the case dmax = oo. Denote the number of vertices in 
a tree T £ T hy \T\ and the number of vertices of degree i in T by ni{T). 
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Let T/v be those trees T £ T with \T\ = N. Let 
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be a symmetric matrix with nonnegative entries that we call partitioning 
weights. We define a collection of non-negative numbers called splitting 
weights, wi,W2, ■ ■ ■ ,Wd, by 



Wi 



i+l 



(2.2) 



^Wj,i+2-j. 

We now define a growth rule for planar trees which we call vertex-splitting. 
Given a tree T G T/v 

(i) Choose a vertex v ol T with probability Wi/W{T) where i is the 
order of v and 

d 

W(T) = Y,w,n,{T). (2.3) 

i=i 

(ii) Partition the edges incident with v into two disjoint sets V and V 
of adjacent edges with probability 

Wk,i+2-k 



Pk,i+2~k 



Wi 



(2.4) 



The set V contains — 1 of the edges and V' contains i — {k — \) 
of these edges, /c = 1, . . . , i. For a given k, all such partitionings are 
taken to be equally likely, 
(iii) Move all edges in V' from u to a new vertex v' and create an edge 
joining v to v' . If v is the root, then the new vertex of order one is 
taken to be the root. 

This vertex-splitting operation is illustrated in figure [3] (the root vertex is 
circled) . 

After the splitting operation, the degree of vertex v \s k and the degree of 
vertex v' \s i + 2 — k. Since the maximum allowed vertex degree is d we define 
Wd+i,i = wi^d+i = 0, i.e. we do not allow splittings of vertices of degree d 
that produce vertices of degree d + 1. If the partitioning weights are chosen 
such that Wi^j = for i 7^ 1 or j 7^ 1, then the vertex-splitting model is 



equivalent to the preferential attachment model discussed in [Ic 

In Sections 3 and 4 we will find it convenient to label the vertices according 
to their time of creation. In this case we append the following to our rules: 
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(iv) Let a be the label of the vertex chosen in (i). If v is further away 
from the root than v' in step (iii) then we let v keep the label a and 
give v' the label N. Otherwise label v with TV" and label v' with a. 

This book-keeping device has no effect on the dynamics of the model. The 
single root vertex (which is the only tree in 7i) has label 0. We will often 
think of the number of vertices (or equivalently, the number of links) as time 
and denote it by t + 1 (or t in the case of links) assuming we start with the 
single vertex tree at time t = 0. 

If the partitioning weights are chosen such that the splitting weights are 
linear, 

Wi = ai + b (2-5) 

for some a,b > then the model is easier to analyze since the weight of a 
tree T eT depends only on the size of the tree 

>V(r) = (2a + 6)|r| -2a. (2.6) 

This is easily seen from the two constraints on the vertex degrees, 

d d 
J2ni{T) = \T\ and ^ mi(r) = 2(|r| - 1). (2.7) 

1=1 i=l 

By abuse of notation, in this case we will write W(|r|) = W(r). We will 
also sometimes restrict to uniform partitioning weights, i.e. 



' Wk/{^2^) for z = + if k < d, 

Wk/{1) for i = 2, ifk = d. 



(2.8) 



2.2. Distribution of vertex degrees for linear splitting w^eights. Start 
from a finite tree Tq at time to = \To\ and perform vertex splitting according 
to the rules described in the previous subsection r times. We then obtain a 
tree in %q+t- Let t = to + T. The vertex splitting operation induces a prob- 
ability measure fit on %, which of course depends on the initial tree Tq. In 
this section we will drop Tq from function arguments with the understanding 
that it is implied, unless otherwise stated. 
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Let pt{ni, . . . , Ud) be the probability that T G % has (ni(r), . . . , nd{T)) = 
(ni, . . . , Ud) according to the measure /xt. We wish to find the mean value of 
nk{T) for T £ Tt with respect to the measure fit- Denote this value by n^^fc- 
We define the vertex degree densities pt^k = ^t,k/t and with some conditions 
on the partitioning weights we will prove the existence of the limit 

lim pt^k = Pk 

and show that the pk satisfy a system of linear equations. 

Let x = (xi , . . . , Xd) € and define the probability generating function 

Wt(x) = Yl Pt{ni,...,nd)xl'---xy (2.9) 

ni+--nd=t 

Proposition 2.1. The probability genereting function Ht^x.) satisfies the 
recurrence 

Wm(x) = E c(x)-V(xr---x-'^) (2.10) 



niH \-nd=t 

for all t > to, where 

c(x) = (ci(x),C2(x),...,Cd(x)) (2.11) 

with 

i 

Ci(x) = -Ywj,i+2-jXjXi+2-j (2.12) 

and V = (d/d xi,. . . , d/dxd^ is the standard gradient operator. 

Proof. Any tree contributing to Ht+i can be obtained by splitting a vertex 
in a tree on t vertices. This process can be divided into three steps: 

(i) Choose a tree T e % with vertex degree distribution (ni, . . . ,nd) 
with probability pt{ni, . . . , rid). 

(ii) Select a vertex in T of degree i with probability riiWi/ njWj. 

(iii) Partition the edges incident to the chosen vertex into two sets V and 
V of adjacent edges with j — 1 and i + 1 — j elements, respectively, 
with probability iwj^i^2-j/wi if i 7^ ^ + 2 — j and with probability 
^Wj^i-\.2-j/wi \i j = i + 2 — j. In the latter case there is a symmetry 
between V and V which accounts for the factor 1/2. 

Multiplying together the probabilities in (i)-(iii) gives the probability of 

removing a vertex of degree i and creating two new vertices of degree j and 
i + 2 — j. In terms of the generating function this amounts to replacing 
x"^ • • • x^''- by x'^'^XjXiJ^2-jXi^ ■ ■ ■ x^lj^ . The probability is 



Mni,...,n,) ^^ 



iwj,i+2-j if J / i + 2 - j, 

(2.13) 

^Wj^i+2-j otherwise. 
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The partial derivative d/dxi in V takes care of removing a vertex of degree 
i and provides the factor m. In Ci(x), the factors XjXi+2-j add two vertices 
of degree j and i + 2 — j respectively and the appropriate weights are given. 
Now sum (I2.13P over j = + i.e. over all possible partitionings in 

(iii) , to obtain the sum in (I2.12P . Note that terms for which j ^ i + 2 — j 
appear twice in the sum, since Wp^q is symmetric in p and q, and the term 
for which j = i + 2 — j appears once. This explains the factor 1/2 in front 
of the sum in ()2.12p . The dot product of c(x) and V accounts for the sum 
over all vertex degrees, and finally sum over all vertex degree configurations 
in the initial tree to obtain (j2.10[ □ 

For linear weights (|2.5p . equation (j2.10p reduces to 

7^t+i(x) = :^c(x).VW<(x). (2.14) 

The remainder of this subsection concerns linear weights only. We have 

nt,k = ^ pt{ni,...,nd)nk = 5fcWt(x)|x=i, (2.15) 

ni+...+nii=t 

where 1 = (1, 1, ... , 1). To get a recursion equation for n^^^, differentiate 
both sides of (j2.14p with respect to x/^ and set x = 1 to find 



^ ( ^ iwk,i+2-knt,i + Y^Wididknt{^)\,c=i j 



(2.16) 



Since the weights are linear we can use the constraints in (|2.7p to rewrite 
the last term in (|2.16p as 

d 

J2^ididkHt{^)\^=i = {-Wk + W{t))nt,k. (2.17) 
1=1 

Inserting this into (j2.16p we see that the equations close 

nt+i,k = (^-Wknt,k + iwk,i+2-knt,i^ +nt^k- (2.18) 

We can also write the recursion in terms pt^k and find 
t 



Pt+l,k 



i-WkPt,k+ ^ iwk,i+2-kPt,i\ +t{pt,k - Pt+i,k)- (2.19) 
V i=k-l J 



The above equation can be put in the matrix form 

Pt+i = ^tPt (2.20) 
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where 



Pt = {pt,i, Pt,2, ■ ■ ■ , Pt,d)^ , At = + — 



t + i\ w{ty ^ 

(2.21) 



B = 



Wl,2 2W1,3 

W2,l 2'W2,2 

2to3,i 



(d-2)wi,d_i 

(d-2)w2,d-2 {d-l)w2,d-l dW2,d 
(d - 2)w3,d_3 (d - l)W3,<i-2 dM;3,d-l 



(d-2)M;d-i,i 




(d-l)Wd-l,2 dWd-1,3 

(d-l)wd,i dwd,2 



(2.22) 



and I is the identity matrix. 

If we denote the vertex degree densities of the initial tree Tq by p^o 
can write the densities for trees on t vertices which grow from the initial 
tree as 



(2.23) 



We will establish convergence of the right hand side by imposing some tech- 
nical restrictions on B. It turns out that the limiting distribution is in- 
dependent of the initial distribution p^^. We begin with some necessary 
lemmas. 

Lemma 2.2. //A is an eigenvalue ofB with corresponding eigenvector 
eA = {exi,- ■ ■ ,exd), «-e. 



BeA = Xex, 

then the following holds: 

d d 



and 



d 

A ^ iexi 
1=1 



1=1 



i=l 



(2.24) 

(2.25) 
(2.26) 



Proof. We prove the second identity. The first identity is established by a 
similar calculation. Multiply the z-th component of the eigenvalue equation 
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()2.24p by i and sum over i to get 

d d d d 

A ^ iexi = - ^ iWiCxi + ^ kwi^k+2-iexk 

i=l 1=1 i=l k=i—l 

d d /k+1 \ 

= -^iwiexi + exk- (2.27) 

i=l k=l \i=l ) 

Using Wi^j = Wj^i we find that 

k+i k + 2 ^"""^ 

^ iWi^k+2-i = ^ Wi^k+2-i (2.28) 

1=1 1=1 

and this together with the definition of the sphtting weights (j2.2p proves the 
identity. □ 

Lemma 2.3. // 

(1) Wk^i = wi^k > for k = 1, . . . , d (i.e. it is possible to produce vertices 
of maximal degree d from vertices of degree j < d) and 

(2) Wi^d+2-i > for at least one i with 2 < i < d — 1 (i.e. it is possible 
to split vertices of maximal degree d), 

then W2 is a positive, simple eigenvalue o/B. All other eigenvalues o/ B 
have a smaller real part. The corresponding eigenvector can be taken to 
have all entries positive. 

Proof. We begin by choosing a number 7 > iaiaxi<k<d{'Wk — kwk,2} and 
define P = B + 7I. The matrix P has only non- negative entries and the 
conditions (1) and (2) on B guarantee that it is primitive, i.e. there is a 
number k such that all entries of the matrix P^ are positive. Therefore, 
by the Perron- Frobenius theorem [25], P has a simple positive eigenvalue r 
and all other eigenvalues of P have a smaller modulus. The corresponding 
eigenvector can be taken to have all entries positive. We normalize the 
eigenvector such that 



]eri = l. (2.29) 

i=l 

Shifting back to the matrix B we find that vu = r — 7 is a simple real 
eigenvalue of B with the largest real part and the corresponding eigenvector 
is = ej.. We see right away from (j2.25p and with the chosen normalization 
that 



'^mewi- (2.30) 



w 

i=l 

Since the weights are linear. Lemma 12.21 shows that w = W2. □ 

Note that the first condition on the weights in the above lemma is natural 
since we have fixed a maximal degree d and therefore we want to be able to 
produce vertices of degree d. The second condition, however, does not seem 
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to be necessary for the results to hold but we still require it in order to use 
the Perron- Frobenius theorem for primitive matrices. This condition is not 
very restrictive in the case of linear weights since it holds for all a and b 
except when ad + b = 0. 

Lemma 2.4. Let A G C. Then 
as t ^ oo. 

Proof. The result follows from the identity 



, if A = W2, 

if Re(A) < W2 



(2.31) 



i=to 



r 




2a-A 




W2 


r 




2a\ 






W2 J 



r ( to - — 

I " W2 



W2 



□ 



Theorem 2.5. With the assumptions on B in Lemma \2.3\ and the addi- 
tional assumption that B is diagonalizable, the limit as t ^ oo of the right 
hand side of equation i2.23\) exists and is given by the eigenvector By^.^ of B 
normalized such that 

d 

^e^,i = l. (2.33) 

i=l 

Proof. We use the normalization in (j2.33p and expand pj^^ in the basis of 
eigenvectors of B. Using the results of Lemmas 12.21 and 12.31 and that Tq 
satisfies the equations in (j2.7p we see that the expansion is of the form 

W2to -2a 

Pto = T + 2^ aiSx^ (2.34) 

1=1 

where Aj, i = 1, . . . ,d — 1 are the eigenvalues of B with real part less than 
W2- The result now follows from Lemma 12.41 □ 

Theorem 2.4 shows that with the above conditions on B the limit of the 
vertex degree densities exists, is independent of the initial tree and is given 
by 

p = lim Pf = ew2- (2.35) 

n^oo 

The limiting densities are therefore the unique positive solution to equation 

d 

Wk . sr^ .Wk,i+2-k „„^ 
Pk = Pk + > ^ i Pi- (2.36) 

W2 W2 
i=k—l 
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2.3. Explicit solutions. We discuss three simple special cases. 
1) When d = 3 we find that 



B 





W2,l 





2^1,3 
W2,2 - 2U'3,1 

2tt;3,i 





3u;3,2 




(2.37) 



If the weights satisfy the conditions in Lemma 12.31 it is easy to see that 
B is diagonalizable. For linear splitting weights Wi = ai + b and uniform 
partitioning weights the positive solution of (j2.36p is 

2 3 
Pi = P3 = y and />2 = y (2.38) 

for all values of a and b as can easily bee seen from the simple structure of 
the B in this case. 

2) When d = 4, the splitting weights linear and the partitioning weights 
uniform one can check that 



B 



■ 

a + b 





|(2a + 6) 
-i(2a + 6) 



|(2a + 6) 







i(3a + 6) 
i(3a + 6) 
-i(3a + 6) 
i(3a + 6) -i(4a + 6). 





|(4a + 6) 
|(4a + 6) 



(2.39) 



When 4a + 6 > the weights satisfy the conditions in Lemma 12. 3[ The 
eigenvalues of B are -^(33a + 136 ± Va^^^TSaF^^lW) , W2 and 0. This 
shows that B is diagonalizable except when a/b = 39 ± 16\/6. One can 
analyze these cases separately using a basis of generalized eigenvectors and 
show that the right hand side of equation (12.230 still converges to e^^ ■ 
3) Fix a maximal degree d. Choose partitioning weights 



W24 



Wi,i '■ 



: d' 



1)" 



2,...,d, 



and all other weights equal to zero. The splitting weights are then Wi = 1 
for i = 1, . . . ,d. Note that if we take the limit d — > oo we get a special case 
of the preferential attachment model. These weights satisfy the conditions 
in Lemma 12.31 The nonzero matrix elements of B are 



Bi+i,i — Bi^i — —Bi^i — i?2,i - 
The characteristic polynomial of B is 



B 



2,d 



1, I <i <d. 



p,(A) = (-1)^^(1- A) (l-(l + A)^-i) 



(2.40) 



(2.41) 



which can easily be proved by induction. The roots of the characteristic 
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polynomial are A = 1 and A = exp (^^) — 1, A; = 1, . . . , d — 1 and they are 
all distinct which shows that B is diagonalizable. The solution to (j2.36p is 

P^ = 2'^-^ - 1 ' k=l,...,d. (2.42) 

2.4. Discussion of the conditions on the weights. It is not obvious 
how restrictive the condition that B must be diagonalizable is regarding the 
collection of weights one can consider. In the previous subsection we saw 
that for (i = 3 and d = 4 the condition was not very restrictive. Also we 
saw that for every d there is at least one choice of weights which satisfies 
the conditions in Lemma 12.31 and yields a diagonalizable matrix B. We will 
now show that this guarantees that almost all weights give a diagonalizable 
B. 

Fix a maximal degree d. Let be the set of matrices B which correspond 
to partitioning weights that give linear splitting weights and satisfy the 
conditions in Lemma 12.31 It is clear that if B,B' G then 
tB + (1 - t)B' G Bd for all t £ [0, 1] and so Bd is convex. Let 

B'd = £ Bd \ ^ is diagonalizable.} . 

From the previous subsection we know that B'^^ f/i. Since Bd is convex and 
/ then by Corollary 1 in [18], B'^ is dense in Bd- 

We believe that it is possible to extend the result of convergence of the 
right hand side of ()2.23p to all partitioning weights giving linear splitting 
weights, relaxing both the condition of diagonalizability of B and condition 
(2) in Lemma 12.31 We also believe, in view of simulations, that equation 
(j2.36p even describes correctly the vertex degree distribution for non-linear 
splitting weights and for the case d = oo. In the special case of the prefer- 
ential attachement model the vertex degree distribution can be calculated 
by another method [13] • We will look at this more closely in the next two 
subsections. 

2.5. Mean field equation for general weights. To generalize equation 
()2.36p beyond the case of linear splitting weights we notice that Lemmas 12.21 
and 12.31 do not rely on the linearity of the weights except in the conclusion 
of Lemma 12.31 where we show that w = W2- We therefore conjecture that in 
general the limiting vertex degree densities are the unique positive solution 
to 

d 



Pk = Pk + > , i — ■ Pi, (2.43) 

■ =k-l 



subject to the constraints 

pi + ... + Pd = l (2.44) 
wipi + ... + WdPd = w. (2.45) 
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Recall that w is the unique simple positive eigenvalue of B defined in ()2.22p 
with the largest real part of all the eigenvalues and pk, k = 1, . . . ,d are the 
components of the associated eigenvector with the proper normalization. 

The existence and uniqueness of a positive solution to ()2.43p satisfying 
(12.44 jl and ()2.45jl follows from the Perron-Frobenius argument in the proof 
of Lemma 2.2. In order to distinguish (12.43P from (12.36P we refer to it 
as the mean field equation for vertex degree densities. One can also arrive 
directly at this equation by assuming that for large t an equilibrium with 
small enough fluctuations is established, and then performing the splitting 
procedure on this equilibrium. 

The solution to the mean field equation for the d = 3 model and uniform 
partitioning weights is 

7«- V«(a + 24/3 + 24) 
P3 = ^TTTT 3 (2-46) 



where a 



W2 



W3 



6(2a-/3- 1) 
Note that from the constraints we have pi 



and f3 = — -. iNote tnat trom tne constraints we nave pi = ps 

Wl Wl 

and P2 = ^ — 2p3. This solution (and solutions in general) only depends 
on the ratio of the weights. In Figure H] we compare the above solution to 
simulations. 




Figure 4. The value of p^ as given in (12.46P compared to 
results from simulations. Each point is calculated from 20 
trees on 10000 vertices. 



2.6. The dniax = oo model with linear weights. In this subsection we 
drop the assumption that there is an upper bound on the vertex degrees 
but we still assume that all vertex degrees are finite. If we assume that 
equation (|2.36p holds for d = oo, then it is possible to find an exact solution 
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in the case of linear splitting weights, Wi = ai + b, and uniform partitioning 
weights. Equation (12.361) becomes 

Pk = Pk+ > — — — Pi- (2.47) 

'i=k—l 

Subtracting from this the same equation for pk+i we find 

yOfc IH Pk+i 1 H = 7 Pfc-i- (2.48) 

\ W2 J V W2 J k W2 

Let X = h/a. The recursion (12.480 has the solution 

2 



Pk{x) 



where 



C(-l) 

2'^-ir (fc + x) 



L C{x)T{k)T{k + ?, + 2x) 



if X = — 1 and k = 1 
{k + I + 2x) otherwise, 



= <^'^'> 

is a normalization constant such that "^^Pi = 1- Here, lu is the modified 
Bessel function of the first kind. The variable x can take values from —1 to 
oo. The asymptotic behaviour of Pk{x) for large k is 

The special case x = oo corresponds to constant weights for which the 
solution is 

In Figure [5] we compare the above solutions to simulations for five different 
values of x. 

3. Subtree structure probabilities 

In this section we consider the model in which vertices are labeled with 
their time of creation as explained in the definition of the splitting process 
(item (iv)). For convenience we will take time to be the number of edges 
in a tree, denoted by £, starting from the single vertex tree at time and 
adding one link and one vertex at each time step. We consider only linear 
splitting weights Wi = ai + b but comment on generalizations in the next 
section. To simplify the notation we define 

W{£) = W{T)-wi = {2a + b)£-a (3.1) 

where the last equality follows from the linearity of the weights. 

We derive exact expressions for probabilities of particular subtree struc- 
tures as seen from the vertex created at a given time. By averaging over 
these probabilities and assuming the existence of a scaling limit, we shall 
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Figure 5 . The solid lines are y = k + l + 2x plotted against 
k for five different values of x. The datapoints on the graph 
are calculated from simulations of 100 trees on 10^ vertices. 
For a given k and x they are calculated from the measured 
density Pk,mei,sXx) by 



y 



, r(fc)r(fc + 3 + 2x) 

'-^ \^) ot-l-n I \ Pk,meas. \X ) 



2^-ir(/t + x 

with an obvious modification if x = — 1. 



show how to extract the Hausdorff dimension of the tree and derive bounds 
on this dimension. In special cases we give an exact expression for the 
Hausdorff dimension. 

3.1. Probabilities related to subtree structure. Consider a tree on i 
edges generated with the splitting procedure starting from the single vertex 
tree at time 0. Let pr{1]s) be the probability that the vertex created at 
time s is the root. If s < £ we find that 

PR{t, s) = - l)pn{i - 1; s\ (3.2) 

since we can split any vertex except the root in order to get from a tree at 
time £ — 1 to a tree at time £. This contributes the factor pnii — l;s) to 
Pr{1;s). Similarly, 

PR{t, t) = E - 1; (3-3) 

since if we create a new root vertex at time £. the previous root vertex, 
labelled s in (j3.3p could have been created at any time before I. We depict 
these processes in Fig. [6l 
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Figure 6. Diagrams representing equations p.2p and (13. 3p . 



If f is a vertex of order k in a tree T, then there is a unique hnk li incident 
on V leading towards the root (unless v is the root). Let £2, ■ ■ ■ ,ik be the 
other links incident on v. The largest subtree of T which contains the root 
and ii but none of the links ii with i > 2 will be called the left subtree (with 
respect to v). The maximal subtrees which contain one ij with j 7^ 1 and 
no other link ii will be called the right subtrees (with respect to v). If A; = 1 
then there are of course no right subtrees and if v is the root then we view 
the left subtree as being empty. Let pk{ii, ■ ■ ■ ,ik] s) denote the probability 
that the vertex created at time s has a left subtree on ii edges and right 
subtrees on £21 ■ ■ ■ , edges, where ii + . . . + £k = ^- By the nature of the 
splitting operation, pfc(^i, • • • , ^fc; is symmetric under permutations of 
{£2, ■ ■ ■ , £k)- We will sometimes refer to the vertex created at time s as the 
s-vertex. 

By the definition of the relabeling when we split we have 

Pi{£;£) = 0, (3.4) 

because the vertex closer to the root gets a new label and therefore no leaf 
except the root can have the maximal label. In the case s < ^ we find the 
recursion 

d-l 



^ p,{£[,...,£',;s) + 

i=l £[ + ... +£'-=£-1 



(3.5) 



The first term in the square bracket corresponds to the case when we do 
not split the vertex with label s. The second term corresponds to splitting 
the s-vertex which can have any order up to d — 1. Finally the last term 
corresponds to the special case when we have £ = 1 so the s-vertex is the 
root of the trivial tree, see Fig. [71 
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For a general k >2 and s < i the recursion can be written 

1 



Pfe(^i,...,4;s) 



W{£-l)+wi 

k 

6k2Si,iWipR{e -l;s) + Y, W{ti - l)pk{h, . . . , - 1, . . . , 4; s) (3.6) 

i=l 

d 

+ ^ + 1 - ^) Wk,i-k+2 

i=k e[+...+e[^^_^=e^-i 

see FiglHl The first term corresponds to the case when the s-vertex is the 
root before the splitting in which case we have ii = 1 and k = 2. The 
second term corresponds to the case when we split a vertex different from 
the s-vertex and the last term arises when we split the s-vertex in the step 
from time ^ — 1 to time i. Finally we have 

£-1 k d-1 

X^X] X] X] Wk,i-k+2Pi{h, ■ ■ ■ Jj-lj'l, ■ ■ ■ ■ ■ ■ ^^k] 

s=0j=2 i=k-li[ + ...+l'^^^_^ 

where ii + . . . + ik = see FigEl Here s is the label of the vertex that 
is split in the step from time ^ — 1 to time i and we sum over all possible 
degrees of the s-vertex and all ways of splitting it. 

We define the following mean probabilities by averaging over the vertex 
labels in dSSHSZD 



Pfi(^) = (3.8) 

s=0 
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and 



1 ^ 

s=0 



(3.9) 



where ii + . . . + = i From (j3.8p we get a recursion for the mean proba- 
bihties, going from time ^ to ^ + 1 



+ 1 
+ 2 



For A; = 1 we obtain from ([33]), ([33]) and ([M 



(3.10) 



(3.11) 



^ + 1 1 

i + 2 W{£) + U7i 



d-l 



^(^)Pl(^) + E 

Pi(/i,...,^:) + 25,o«)i 



i=l 
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Finally, the general case for /c > 2 is 
£ + 1 1 



i + 2 w{e) + wi 



Sk2Si,iw,pn{i) + J2 ^(^i - ^)Pki^^^ . . . , £i - 1, . . . , 4) 



i=l 
d 

+ Y,ii-k + l)wk,^-k+2 J2 Pi{i'i,---,4+i-k,i2,...,ik) (3.12) 

i=k + 
k d 

4) 

j=2 i=k-l 

where ii + . . . + Ik = i + I and we have made use of (|3.6p , (j3.7p and (|3.9I) . 

3.2. Two-point functions. One can reduce the above recursion formulas 
for the mean probabilities to simpler recursion formulas which suffice for the 
determination of the Hausdorff dimension. Define the two-point functions 

qH{hj2)= Yl E Pk{i'l,■■■J'k-^Jl,■■■J^), (3.13) 
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where k = 2, . . . ,d and i = 1, . . . ,k — l. In total there are d{d — l)/2 of these 
functions. If we define 

91,0(4,^2) = Si^oSi^iPiih +£2) 

then qki{£i,£2) is the probabihty that i right trees of total volume £2 are 
attached to a vertex of degree /c in a tree of total volume £1 + £2. By 
summing over the equations in the previous section we get 



'+1 1 



t + 2 W{£) + wi 
d 

^ Wk,j+2-k[U -i)qji{£i - IJ2) + iqj,j-{k-i){h,£2 - 1) 



qki{h,£2 



j=k-i 

+ (wi£i -l) + {k-i-l){w2- u;3))(7fc^(4 - 1,^2) 
+ (W{£2 + IKW2 - Ws)'^qk^{£l,£2 - 1) 

+Sk2Se^iwipR{£2) + 6ii6e^iWk,i ^ Pk~i{£[, ■ ■ ■ J'k-i) 

(3.14) 

with £i+£2 = £+1. We see that the two-point functions satisfy an essentially 
closed system of equations. The last two terms in p.l4p do not contribute 
to the scaling limit which will be discussed in the next section. 



4. HAUSDORFF DIMENSION 

In this section we relate the two-point functions defined in the previous 
section to the size of trees, defined in a suitable way. With the help of 
some scaling assumptions this relation allows us to calculate the Hausdorff 
dimension of the trees as a function of the partitioning weights in simple 
cases. As in Section [3] we assume that the weights Wi are linear in i and we 
shall comment on the general case at the end of the section. 

4.1. Definition. Let T be a tree with £ edges and v and w two vertices of 
T. The (intrinsic geodesic) distance dj-iv, w) between v and w is the number 
of edges that separate v from w. We define the radius of T with respect to 
the vertex v as 

w 

where k{w) is the degree of the vertex w. Notice that 2£ = k{w). The 
global radius of T is 

RT = Y^Y.^T{v)k{v). (4.2) 

V 
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We define the Hausdorff dimension of the tree, dn, from the scahng law for 
large trees 

(Rt) ~ ^^/"^^ i^oo. (4.3) 

The more usual definition of the local fractal dimension of a vertex v of 
the tree, df{v), is defined by the growth of the volume of a ball of radius r 
around v, i3„(r) = {w : d{v,w) < r} 

(Card(i3^(r))) ~ r'^f^"^ l<.r^i. (4.4) 

These two definitions are expected to coincide provided that the tree is a 
homogeneous fractal (no multifractal behaviour). We expect that the scaling 
behaviour ()4.4p is valid independently of the point v and also that the scaling 
(j4.3p holds for Rt{v) defined by (j4.ip irrespective of the point v chosen. 

4.2. Geodesic distances and 2 point functions. The radius Rt{v) can 
be extracted from the two point functions calculated in the previous section. 
Let T be a tree and v a vertex of T. Let i be an edge of T. If we cut this 
edge then the tree is split into two connected components, a tree Ti which 
contains v and a tree T2 that does not contain v (see Figure 8). Let ^2(^5 i) 
be the number of edges of T2. We have the simple remarkable result 

driv, w)k{w) = Y.{2e2{v; i) + 1) (4.5) 

to i 

which we will now prove. For the tree T with i edges, we may assign two 




ii 

Figure 10. Cutting a tree along the edge i. 

labels to every edge in the following way. Starting from v, we walk around 
the tree while always keeping the tree to the left. Drop the labels 1 to 2£ on 
the sides of edges as we pass them. 

An example of such a walk and labelling is shown in Figure [TTl Let us 
mention that the initial direction from v is unimportant. In what follows we 
will denote these new labels by greek letters. 



RANDOM TREE GROWTH BY VERTEX SPLITTING 



25 



Figure 11. A tree and its labels. 

Given 1 < a < f3 < 2i, define (pvict, (3) to be 1 if a and (3 are labels of the 
same edge, and zero otherwise. In the above example we have i;^t,(6,9) = 1 
whereas (/>^(6, 12) = 0. For any vertex w £ T, let us define u{w) to be the 
smallest label of the edges adjacent to w. In the example above lo^w) = 6 
and u;{v) = 1. 

We now have for any w £ T 

dT{v,w) = Yl Ma,P) (4.6) 

C(,/3: a<uj{w)<f3 

and it is easy to see that 

'Y dTiv,w)k{w) = ^ (pvia,(3) = ^ (py{a, I3){(3 - a). 

w&T «,/3,7: a<7</3 a,/3;a</3 

(4.7) 

If (f){a, P) = 1, i.e. if a and (3 correspond to the two faces of the edge a, 
then 

f3-a = 2i2iv,a) + l (4.8) 

and equation (j4.5p follows. 

We now apply ()4.5p by choosing for v the root r of the tree and averaging 
over all trees obtained by the splitting process. We notice that the average 
number of links giving the volume i2i''',i) is simply the number of links i 
times the proportion of vertices w which have a left tree (containing the root 
r) on ii = i — £2 edges and an arbitrary number of right trees (on a total 
number of £2 edges). We obtain 

(RTir)) = ^ E (2^2 + l)Yl 1k,k-i{i - i2,i2), (4.9) 

£2=0 k=l 

see Fig. [12] for a simple diagrammatic representation of this identity. Thus, 
if we know how the two point functions qk,i{£i,£2) scale for large i, we know 
how the radius of the tree scales with I and we can compute the Hausdorff 
dimension dn- 

4.3. Scaling and the Hausdorff dimension. We assume that the follow- 
ing scaling holds for the two-point functions q^i with £ large 

qkiih,i2) = £-P{uJk^{x)+^k^{x)r' +0{£-^)) (4.10) 
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Figure 12. Illustration of eq. 14.91 

where £i + £2 = ^, x = £i/i g]0, 1[ and where ujki,^ki ^-re some functions. It 
must hold that ZJki > and we assume that the scaling exponent p satisfies 

l<p<2. (4.11) 

Note that for i finite, the probabilities qk,i{^i-,^2) are of order when 
is of order 1 and are of order 1 when (.2 is of order 1. This implies that the 
scaling functions T^kii^) should scale when x — > or 2; — > 1, respectively, as 

ujki{x) ~ x^-P and Uki{x) ~ (1 - x)"^. (4.12) 
Using this ansatz and ()4.9p the mean radius scales as 

{RT{r))^e-PC, C= [\x{l-x)Zj{x), cj(x) = VcJfc,fc_i(x). 

(4.13) 

Equations (j4.12p and ()4.1ip ensure that the integral C is convergent when 
p < 2. Equation (j4.3p then implies that the Hausdorff dimension of the tree 
is given by 



2-p = 




(4.14) 



For p = 2 we see that C is logarithmically divergent and this corresponds 
to an infinite Hausdorff dimension. 

Inserting ()4.10p into the recursion equation (j3.14p for the two point func- 
tions and expanding in gives (dropping the function argument x in an 
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obvious way) 

= -lr^(i- "^ + '"^""^ r^ + o(r^) 



d 

x[ ^ Wk,j+2~k{{j -i)'^ji + i'i^j,j-(k-i) + 0{r^)^ 
j=k-i 

+£(W2X + i-Ws + {k-i- 1){W2 - Ws))r'^ (uJk^ + Jk^r^ + 0{r^)) 
+i(w2{l -X)+ i{w2 - W3)r^^ (LJk^ + Jm^' + 0{r^)^] . (4.15) 

The equation is trivially satisfied in zeroth order of When we go to the 
next order we see that the following must hold 

1 

(2-p)uJki = — ^ Wk,j+2-k(^ij -i)'^^ji + i^^jj-^{k-i)) -—^ki- 

(4.16) 

This eigenvalue equation may be rewritten as 

Cu = W2{2-p)u (4.17) 

where C is a (2) x (2) matrix indexed by a pair of two indices ki with 
k > i, k = 2, . . . ,d and a; is a vector with two such indices. The matrix 
elements of C are 

Cki,jn = Wk,j+2-k{{j -i)Sin + iSnJ-{k~i)) -WkSkjSin- (4.18) 

We use the convention that Wij = if i or j is less than 1 or greater than 
d. Thus, W2{2 — p) is an eigenvalue of the matrix C and the associated 
eigenvector must have components > 0. We now show that there is in 
general a unique solution to this eigenvalue problem. 

Since the only possibly negative elements of C are on the diagonal we 
can make the matrix non-negative by adding a positive multiple 7 of the 
identity to both sides of (I4.17P and choosing 7 large enough. 

If enough of the weights Wij are non-zero {wi^i > for 2 < i < d and 
Wj^3 > for 2<j<d— lis for example sufficient) then one can check that 
the matrix C + 7I is primitive. Then, by the Perron-Frobenius theorem, 
it has a simple positive eigenvalue of largest modulus and its correspond- 
ing eigenvector can be taken to have all entries positive cf. Lemma 12.31 
Therefore this largest positive eigenvalue gives the p we are after. 

4.4. An upper bound on the HausdorfF dimension. We can get an 

upper bound on /) by a straight forward estimate from (I4.16p . The off- 
diagonal terms in the sum are all non-negative so we disregard them and 
get the inequality 
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p < 2-(/t^-^V A: = 2,...,d. (4.19) 



W2 W2 ) 

Since 1 < p < 2 and for A; > 3 

fc+i 

2 



k 

Wk = kwk,2 + ^ 22 '^i,k+2-i > kWk,2 (4.20) 
1=1 

the best we can get from this upper bound is when k = 2 which yields 

p < 2- ^^'^"^^^'^ (4.21) 

W2,2 + 2wi,3 

Now, 2 — p = -j^ and therefore, if W2,2 > 2wi^3, we obtain the upper bound 

dn < !^M±|^. (4.22) 

W2,2 — 2Wi^3 

If W2,2 ^ 2wi^3 the upper bound in (|4.21|) gives no information about the 
Hausdorff dimension. 

It is easy to verify that (14.221) is an equahty if we choose the weights such 
that Wij = if i 7^ 1 or j 7^ 1 with the exception that W2,2 > 0. This 
condition means that we only allow vertices to evolve by link attachment, 
except that we can split vertices of degree 2. With this choice the matrix C 
is lower triangular and we can simply read the eigenvalues from the diagonal. 
Note that C is not primitive in this case and therefore we cannot use the 
Perron-Frobenius theorem to determine which eigenvalue gives the scaling 
exponent. However, with these simple weights one can show explicitly that 
there is precisely one eigenvector with strictly positive components and the 
corresponding eigenvalue is the one that saturates the inequality in (j4.22p . 

Also note that with this choice we have set = and since the weights 
are linear, Wi = ai + b, we have fixed a and b so that Wi = 1 — 2- Therefore 
there is only one free parameter which we can choose to be 1^2,2 • Then we 
can write the Hausdorff dimension as 

l-2/d , , 

with 

^(1 - 2/d) < W2,2 < 1 - 2/d. (4.24) 

We see that for any d the Hausdorff dimension can vary continuously from 
1 to infinity. 
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4.5. Explicit solutions and numerical results for dmax = 3. When the 
maximal degree is d = 3, the spUtting weights are taken to be linear 
Wi = ai + b and the partitioning weights uniform, it is easy to solve equation 
(j4.16p for the Hausdorff dimension . Since the solution only depends on the 
ratio of the weights there is only one independent variable and we choose it 
to be y := W3/W2 where < y < 2. The solution is 

i„ = 3(1 + y/r+W) ^ 

In Figure [13] we compare this equation to results from simulations. The 
agreement of the simulations with the formula is good in the tested range 
0.5 < y < 2. For smaller values of y the Hausdorff dimension increases fast 
and one would have to simulate very large trees to see the scaling. 

r , 1 , 1 , 1 , 1 1 



2.5 - 



■o 

2 - 



1 I < 1 < 1 < 1 < 1 1 

0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 

y 

Figure 13. Equation (j4.25p compared to simulations. The 
Hausdorff dimension, dn, is plotted against y = w^/w2- The 
leftmost datapoint is calculated from 50 trees on 50000 ver- 
tices and the others are calculated from 50 trees on 10000 
vertices. 



4.6. Hausdorff dimension for general weights. 

4.6.1. General mean field argument. Our argument to compute the Haus- 
dorff dimension relies on the recursion relations for the substructure proba- 
bilities, studied in Section 3, which are valid only when the splitting weights 
Wi are linear functions of the vertex degree i {wi = ai + b). In this case the 
total probability weight W(T) for a given tree T depends only on its size i 
(number of edges) and mean field arguments can be made exact. 
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In the general case where the Wij are arbitrary and the Wi are not hn- 
ear with i, these recursion relations are no more exact. We can use a 
mean field argument and assume that they are still valid for large "typi- 
cal" trees, provided that we replace in these recursion relations the exact 
weights W{i) + wi = W(T) by their mean field value for large trees 

w{i) +wi ^ wil) = WjUi+ij (4.26) 



where ne+ij is the average number of z-vertices in a tree with i edges, studied 
in Section 2. From the mean field analysis of Section 2.5, we expect that 
these scale with £ as 

ne+ij ~ £pj (4.27) 

with the vertex densities pj given by the mean field equations (j2. 4312. 4412. 45p 
as the components of the eigenvector p associated to the largest eigenvalue 
w of the matrix B. Thus the mean field approximation amounts to replacing 

W{£) + wi ^ W{e) = wi + --- (4.28) 

in the recursion relations of Section 3, in particular in the recursion relation 
(j3.14p for the two point function qki- 

With this assumption, we can repeat the scaling argument of Section 4.3, 
and we end up with equation (4.16), with the normalisation factor in the 
r.h.s. replaced by the mean field normalisation factor ^ 

1 

(2 - /)) tZJfcj = — ^ Wkj+2-k(ij - i^^ji + Wj,j-{k-i)) - —^ki ■ (4.29) 

This equation is still an eigenvalue equation of the form 

Cu = w{2- p)uj (4.30) 

where C is the (2) x (2) matrix with coefficients given in (j4.18p . 

If we denote by x the largest eigenvalue of this matrix C and if w is the 
largest eigenvalue of B then the Perron-Frobenius argument can be applied 
to show that x is non-negative and that the eigenvector a? has non-negative 
components, which is a consistency requirement for the argument, since the 
LOki are rescaled probabilities. We end up with a mean field prediction for 
the HausdorfF dimension of the simple form 

dH = -^ = -. (4.31) 
2 - /> X 



4.6.2. General solution for d = 3. In the d = 3 case (trees with only A; = 1, 2 
and 3 vertices), the B and C matrices are 
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B 



■ 




2u;3,i 

W2,2 - 2^3,1 





3^^3,2 





and we find 



W2,2 - 2l«3,l 
■W3,l 
2^3,1 



2'W3,2 






^^3,2 






(4.32) 



cIh 



{W2,2 - 2tU3,i) + VCW2,2 - 2W3,l)^ + 8^3,1(^2,! + 3^3,2) 
{■W2,2 - 2W3,l) + ^(^^2,2 - 2W3,l)2 + 16^3,1^3,2 



(4.33) 



4.6.3. Numerical test of the mean field prediction. We have tested our pre- 
diction when d = 3 and the partitioning weights Wij are uniform. In this 
case the 14.331 becomes 

<i« = "-V- l^ + ^^ + ^ m (4.34, 

a — \J a (a + 16 /3) 

^2 ^3 

where a = — and /? = — . In Figure [TH we compare this equation to 

Wl Wl 

results from simulations. There is a good agreement for small values of a 
and not too small values of /?, but the precision of the numerics becomes 
poor for large values of a and small values of /?. This could be expected 
since in this case, the trees will have a large Hausdorff dimension and finite 
size effects are expected to be large, so that one must go to very large trees 
to see the scaling. Clearly a better estimate of finite size effects and large 
simulations are desirable. 



5. Correlations 

5.1. Vertex- vertex correlations in the linear case. Consider a tree on 
£ edges generated by the splitting procedure starting from the single vertex 
tree at time 0. We are interested in determining the density of edges which 
have endpoints of degrees j and k in the limit when £ — > oo. Denote this 
density by pjk, assuming it exists, and for convenience we let the vertex of 
degree j be the one closer to the root. Therefore pik = for all k and in 
general pjk / Pkj- 

To arrive at these densities we use the same labelling techniques as in 
Ssection [3l We start by defining 

Pjki^lj • • • ) -^j-l j ^1) • • • 5 ^k-l'i ^) 

as the probability that a vertex created at time s is of degree k and has 

. . . , l'l^_i right trees and that the vertex left to it is of degree j with an 
left tree and ^2, . . . A'j-i right trees (excluding the right tree which contains 
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Figure 14. Equation (j4.34p compared to simulations. Each 
datapoint is calculated from 50 trees on 10000 vertices. 



s). Note that it is symmetric under permutations of both {£'2, . . . and 
{i'l,...,e'U) and 

We use the methods of Section [3] to derive recursion equations for 
Pjki^i, ■ ■ ■ , ^j-i! ^1) • • • ) ^k-v ^) linear splitting weights. We then average 
over the label s as before and get recursions for the average probabilities 
Pjki^i, ■ ■ ■ ,£'j-i',ii, ■ ■ ■ , ^k-i)- All equations and diagrams are given in Ap- 
pendix[Xl Finally we define the densities Pjki^) by averaging out the volume 
dependence of the average probabilities 

and similarly we denote the vertex degree density by 



pj{i) = Pi,j = pj{£i,..., £j), 

ei+...+ej=e 
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cf. Section 12.21 We have the fohowing recursion for the densities 

W\w{t)+w\ ^^^'^ - -Wk + 2WI - W2)pjk{() + (i - l)Wj^kPj + k-2{^) 
d d 

+ (j - 1) X] '^j,i+2-jPik{£) + {k-l) ^ Wk,i+2-kPji{i) 
i=j—l i=k—l 



fori, j > 1, see AppendixEI. Now assume that Pjk{f-) = Pjk+fjk^~^+0{l~'^) 
and that a similar expansion holds for Pj{i). Expanding the above recursion 
in gives 

P,k + r,,r^ + oin = fi- "^ + ^"^-"^ r^ + or^) 

\ W2 

1 + ""^•""^ + '"^""^ r^) {P,k + r.ki-^ + o(r^)) 

W2 I \ J J ) 



W2 



{j -l)wj,k{Pj+k~2 + 0{i-^)) + (j-1) ^ u;j-i+2-j (pifc + 0(ri)) 

«=i-i 



-{k-1) J2 Wk,^+2-k {PJ^ + 0{r^)) \. 

i=k-l ^ ' 



This equation is trivially satisfied in zeroth order oi £ ^ . When we go to the 
next order we get the following equation for the limits of the densities 

Pjk = Pifc + U - + U - 1 > — -Pik 

W2 W2 W2 

i=j-l 

+{k -DE ^^^p,. (5.1) 

i=k-l ^ 

We can also arrive directly at this equation by assuming that for large (. an 
equilibrium with small enough fluctuations is established, and then perform 
the splitting procedure on this equilibrium. With the same methods, it is 
possible to derive an equation like (jS.lh for the density Pji,j2,---,jR of linear 
paths of length R—1 directed towards the root containing vertices of degrees 
ji, ■ ■ ■ Jr- 

Existence of solutions to equation (jS.ip can be established by the Perron- 
Frobenius argument like in the previous sections. In the following subsec- 
tions we will find an explicit solution for linear weights and discuss general- 
izations for non- linear weights. In both cases we compare the results with 
simulations. 
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5.2. Solution in the simplest case. When d = 3, the sphtting weights are 
hnear and the partitioning weights are uniform, we know that pi = ps = 2/7 



and p2 = 3/7, see Section 
equation (15. ip are 



Let y = wz/w2- Then the solutions to 



P2l 



P22 



P23 



4(3 -j/) 
7(11 -2y)' 

42/2 - l2y + 105 
7(2y + 7)(ll-2y)' 

2(- V + 20y + 21) 
7(2y + 7)(ll-2y) ' 



10 



P31 



P32 



P33 



7(11-22/)' 

2{-8y^ + I8y + 63) 
7(2y + 7)(ll-22/) ' 

8(32/ - 14) 
7(22/ + 7) (22/- 11)- 



(5.2) 



Note that the following sum rules hold for the solutions 

P21 + P31 = Pi = 2/7 

P22+P32 = P2 = 3/7 

P23+P33 = P3 = 2/7, 

P21+P22+P23 = P2 = 3/7 

P31+P32+P33 = 2p3 = 4/7. 



(5.3) 



These relations show that there are only two independent link densities. We 
plot p2i and P22 in Figure [15] and compare to simulations. 



P21 

P22 



0.15 - 



0.05 I < < < < 1 

0.5 1 1.5 2 

y 

Figure 15. Two independent solutions given in ()5.2p plot- 
ted against y = w^/w2 and compared to simulations. The 
two leftmost datapoints on each line come from simulations 
of 50 trees on 50000 vertices. The other datapoints come 
from simulations of 50 trees on 10000 vertices. 
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We can compare the above results to the case when no correlations are 
present. Denote the uncorrelated densities by pij. Then we simply have 

_ PiPj 



I -Pi 

The denominator comes from the fact that the vertex closer to the root is 
of degree one with probability zero. Inserting the values from (|5.3p into this 
equation gives 

~P2i = 6/35, P31 = 4/35 
P22 = 9/35, p-i2 = 6/35 
P23 = 6/35, P33 = 4/35 

showing that in general pij ^ pij and so correlations are present between 
degrees of vertices. 

5.3. Results for non-linear weights. We can generalize equation (|5.ip 
to a mean field equation, valid for arbitrary weights, by replacing W2, where 
it appears in a denominator, with w as we did with the equation for vertex 
degree densities in Section [2j For d = 2> and uniform partitioning weights 
the two independent densities p2i and p22 are given by 

= 1 (3 + /?) (7a -7) , . 

3 (2a-/3-l)(3a + 2/3 + 7 + 6) ^ ' 



P22 = Y (284 a2/3''7 - 177 a^/37 + 3564 + 18 0^7 + 161a /3^7 - 873 7 + 11979 

-2259 - 39 a®/3 - 207 a^-y + 6516 - 5205 a^/J - 1419 a'^jSy + 996 0(3^ 

-5994 a* - 892 a'^/S'^'y + 1543 a^p'^ - 18 a'^ - 668 + 324 a'^y + 909 a/3^7 

-2600 a^fS'^ - 975 + 222 0/3^ - 1533 a^/3^7 + 10206 a^jS^ - 11799 a'^fS 

-5300 a*/3^ - 1521 0^/37 + 1899 a^fd^y + 1059a^ /3^7 + 1269 a^P^ + 32400^ (3 
+756 af3^ + 4860 + 6 /3^7 - 11703 a^/J^ + 17280^ /37 - 162 0=^7 + 486a fS'^y 
+18 f3'^-y + 1530 a/?"' + 624a /3*7 - 772 a^/3^7 - 9 a*' + 24 /3^7) / ((3a + 2/3 + 7 + 6) 

X (11 a^ +25a/3 + 5a7 + 3/37 + 12a + 4/32) (-a + 7) (1 - 2a + /3) {7a + 2/3 + 7)^ ) 



(5.5) 



where a = — , /? = — and 7 = \/a (a + 24 /3 + 24) . These solutions 

are compared to simulations in Figures [16] and [T71 The other densities are 
obtained by using the sum rules (|5.3p . 
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Figure 16. The solution (|5.4p for the density p2\ plotted 
as a function of /? for a few values of a. Each datapoint is 
calculated from simulations of 100 trees on 10000 vertices. 
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Figure 17. The solution (|5.5p for the density p22 plotted 
as a function of /? for a few values of a. Each datapoint is 
calculated from simulations of 100 trees on 10000 vertices. 
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6. Discussion 

In this paper we introduced a simple but general model of random tree 
growth by vertex splitting. The probability weights Wij for each splitting 
process can be chosen independently and are the parameters of the model. 
We study the large time/large tree size asymptotics of this model. A mean 
field theory is presented for the vertex degree distributions and we deduce 
master equations for various correlations functions, like subtree structure 
probabilities and vertex degree correlations. The mean field results are 
proved in some special cases and checked by numerical simulations in many 
general cases. Under a natural scaling hypothesis we are able to compute 
the intrinsic Hausdorff dimension dn of the tree. This Hausdorff dimension 
may take values between dfj = 1 and dfj = oo, depending on the parameters 
Wij. These predictions for dn are checked by numerical simulations. 

There are still many open questions for the splitting model presented 
here. The most urgent one is to prove rigorously that the mean field theory 
is valid in the general case where the splitting weights Wi are not linear with 
the vertex degrees i. Another goal is to get a better understanding of the 
degree-degree correlations for vertices which are at a finite distance greater 
than one, and of the general substructure probabilities. It would also be very 
interesting to obtain general distribution functions for correlations involving 
the geodesic distances on the trees, and more than two points. 

As already mentioned in the introduction, the model considered here is 
related to other tree growth models. Firstly, it contains as a very special 
subclass the so called preferential attachment models, which have been ex- 
tensively studied in the literature (see e.g. [H, [3]). This model consists in 
allowing only growth by attaching a new vertex by an edge to an already 
existing fe-vertex, with an attachment rate Wk depending only on the degree 
k of the initial vertex. In our model this corresponds to allowing only the 
splitting processes 

k ^ l,k + l , 
and to set all the splitting weights to zero except the 

Wk 

Wk+l,! = Wl,k+1 = -j^ ■ 

The preferential attachment model is not covered by our Lemma 12.31 (since 
condition (2) is not satisfied). But the preferential attachement model, in 
particular the vertex degree distribution pk can be studied by different meth- 
ods (see e.g. [20]). It can also be recovered as a limiting case of our model, 
by letting the Wij — > if i or j > 1 in our solution for the pk, and eventually 
taking also as maximal degree d = d^i^^ = oo. In this limit we also recover 
for the Hausdorff dimension 

dn ^ oo 



as expected. 



38 



F. DAVID, W.M.B. DUKES, T. JONSSON, S.O. STEFANSSON. 



Secondly, it also contains as a special limit case the so called alpha-model 
introduced by D. Ford [3], in connection with phylogenetic trees and clado- 
grams. This model is related (but is in general not equivalent) to the so 
called beta-models introduced by Aldous [3, H]- The alpha-model has been 
studied in [l^, 22] and generalised into the alpha- gamma model in (a 
mixture of the alpha-model and of the preferential attachment model). It is 
beyond the scope of this paper to discuss in detail the relationship between 
these different models, as well as the motivations for them. However, we 
discuss briefly the basic idea. 

The alpha model is a growth model for rooted binary trees, with only 
1-vertices (the leaves) and 3-vertices (internal vertices), so that ni = + 
1 (excluding the root). The growth process grafts a new leaf to a given 
edge, with probability weight a if the edge is internal (connects two internal 
vertices) and weight 1 — a if the edge is external (connects a leaf to an 
internal vertex). Thus at each step a leaf and an internal vertex are created. 

This leaf-edge attachment process can be easily reproduced in our vertex- 
splitting model by considering the general model with d = 3 (we allow only 
1-, 2- and 3-vertices) and by taking the weight w^^i — > -|-oo. Indeed, in this 
case, if we start from a binary tree T with no 2-vertices, any splitting will 
produce a tree T' with a 2-vertex, either by splitting a leaf into a 1-vertex 
and a 2-vertex (with probability weight W2,i), or by splitting an internal 
vertex into a 3~vertex and a 2-vertex (with probability weight ^3,2)- But 
since w^^i = +00, at the next step, with probability 1, the new 2-vertex will 
split into a 1-vertex and a 3-vertex, thus producing a new binary tree where 
a new 1-vertex has been grafted on an edge. These two two-step processes 
are depicted in Fig. [18l For large binary trees, the density of 1-vertices 




Figure 18. The two step processes in the d = 3 model which 
reproduce the grafting processes of the a-model. 

and 3-vertices are equal (pi = ps = 1/2, P2 = 0), as well as the density of 
internal and external edges {ps^i = ps^s = 1/2). It is easy to see that our 
model for d = 3 and W3^i = +00 should be equivalent to the a-model with 
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the choice 

a = ■ . (6.1) 

As expected, u)2,2 is irrelevant in the hmit w^^i — > +oo. This equivalence 
can be checked explicitly. For instance it is proven in that the so-called 
Sackin's index S{t) scales with the size t of the binary tree as S{t) ~ t^^". 
Here S{t) is defined as the sum over the leaves v of the number of edges 
between v and the root (minus 1). It is therefore related to the radius of 
the tree, and we expect it to scale as t^+'^/d-a ^ where dn is the Hausdorff 
dimension of the tree. Therefore we expect that 

a = l/dn (6.2) 

in the a-model. This is indeed the case, taking the limit ^3^1 +00 in the 
general formula (14.330 for dfj in the d = 3 model, we obtain 

hm rf, = !f!^i±^ . (6.3) 

1113,1— >oo 21/732 



The alpha-gamma model of is a model of trees which encompasses 
both growth by grafting of leaves on existing branches (like in the alpha 
model) and grafting leaves on existing nodes (like in the attachement mod- 
els). It is very easy to see that the model presented here contains also the 
alpha-gamma model of but we shall not elaborate further on this point. 



Another class of models are the tree growth models considered in [1^ 
which are equivalent to arch deposition models studied in connection with 
random bond RNA folding problems (in the greedy approximation scheme). 
These last models are slightly more complex, since they involve a two step 
splitting process, and their dynamics (i.e. the splitting weights) is fixed by 
the geometry of RNA folding (the splitting weights cannot be choosen ar- 
bitrarily). Variants of these models exhibit universal properties: the fractal 
dimension is always found to be dn = (\/l7 + 3)/4 and some scaling func- 
tions are universal. It would be interesting to study correlations for these 
models with the methods presented here. 

Both classes of models (the alpha and alpha-gamma models and the RNA 
models) are related to fragmentation trees and to coalescent trees, that is 
trees which appear in continuous time processes. It would also be interesting 
to study our model by continuous time methods. Another important ques- 
tion is universality: we have seen that our model, like the models of [16] and 
[1, 0] gives trees with continuously varying scaling exponents. In particular 
we can obtain dn = 1, dn = 2 and dn = 00. We do not know, however, 
whether the trees that we obtain in this case have the same scaling limit 
(scaling exponents and correlation functions) as the trees obtained in for 
a = 1, 1/2 an respectively (the Comb, the Uniform and the Yule models 
of phylogenetic trees) or if our model gives a much richer universality class 
structure. Finally the trees studied here are symmetric, in the sense that 
the "time ordering" of the vertices, or their distance from the root, does not 
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seem to be crucial, since the growth rule by splitting is local and geomet- 
ric. It should be interesting to understand the similarities and the difference 
with the directed trees considered in (for instance) directed polymers and 
evolving population problems (see e.g. and reference therein). 
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Appendix A. Illustration of recursion equations for 

CORRELATIONS 
To make the notation more compact we will write for i < j 



d-i j — -^2, . . . , ^j, and I — ~\~ ' ' ' ~\~ ' 



We can write the following recursions for going from time ^ — 1 to time £. 
Note that s < i in (IAB . (|A4j) and ([0|l . 



^ ^ i—l 

+ 6k2WipR{£ - l;s) + deiSkiWij. 

(A.l) 

pik{e'l,k-i-J) = 

^ i-l d 

WU-D+W ^ ^ (k - l)wk,r+2-k Pl^{K^+l-k,^l.k-l■,s). 

(A.2) 
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1 ,^-1 



w{e-i) + wi 



(A.3) 



i- 1 



E Pii(^~i,»+i-j,^2j-i;s)), 
K~i,,+i-jl=fi-i 



(A.4) 




Figure 19. Illustration of equation (jA.ll) . 



In deriving equations (|A.4p and (|A.5P and the corresponding figures, note 
that the index p is introduced in the second last diagram in each figure. The 
reason for it is the following: even though the functions Pji{i'i, ■ ■ ■ j^'j-i'i s) 
and Pjki^i, • • • 1 • • • , ^fc-i! s) are symmetric under permutations of 

{£'2, . . . , it does matter where the edge going from s towards the root, 
is located. 

Therefore, we group together the balloons counter-clockwise from s to- 
wards the rooted balloon and the balloons clockwise from s towards the 
rooted balloon into another group, one of the groups is possibly empty. If 
the total number of balloons in the groups is i — 2 then there are i — 1 such 
possible configurations. In the equations we therefore divide by i — 1 and 
sum over all the configurations which contribute to the configuration on the 
left of the equality sign. 
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The index p in the sum is the location of s counter-clockwise from the 
rooted balloon. Note that p can be no larger than j — 2 since if it were larger, 
there would be no space for the balloons inside the dotted circle. Note that 
the balloons inside the dotted circle are always drawn clockwise from the 
vertex s. To count the possibility that they are counter-clockwise from s we 
multiply by 2. 



+ (j - i)w'i,fePj+fe-2(^ij_i,^i,fc-i;s) 



E/ i~l E/ E/ E/ Piki^l,n-lT^14+i-jT^'n+lJ-li^l,k-li^) 

' (j - 1) E „ 1 E p»fe(^i,»+i-i'4,j-i;^"fc-i;s); 



1 



(A.5) 
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Figure 21. Illustration of equation (|A.4[) . 



1 



W{l-l) + wi 

£-1 k-1 d 

|£i,,+i_fc|=£;;-i 



s—O n—1 i—k — 1 



(A.6) 
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Figure 22. Illustration of equation (|A.5p . 
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Now average over the label s as before and get the following recursion, 
going from time i to i + 1 



£ + 2 w{e) + wi 



fc-1 



d 



5ifi5kiwi+ (fc-1) E Wk,i+2-k E pu{ii^i+i-kAi,k~i) 



Pji(^'ij-i) = 

J— 1 

+ (■^' ~ ^) E \ f_ 1 E ^'^l(^~l,^+l-J>4J-l))• 
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1 1 



fe-i 



+ (i - i)^«j,fePj+fe-2(^'i,,_i,^i,fc_i) 



j=j-l 



+ (i- i)E ^ . j^^^ wj,i+2-j E p<fc(^i,i+i-j'4,j-i;^i,fe-i) 

^l,n-l5 ^n+l,fc-l) 

n=li=/s-l Ki,i+i_fc|=^;'-l 
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